Robotic Knee Tracking Control to Mimic the Intact Human Knee Profile Based on Actor-Critic Reinforcement Learning
نویسندگان
چکیده
We address a state-of-the-art reinforcement learning (RL) control approach to automatically configure robotic prosthesis impedance parameters enable end-to-end, continuous locomotion intended for transfemoral amputee subjects. Specifically, our actor-critic based RL provides tracking of knee mimic the intact profile. This is significant advance from previous automatic tuning which have centered on regulation with designer prescribed profile as target. In addition presenting complete algorithm direct heuristic dynamic programming (dHDP), we provide an analytical framework controller constrained inputs. show that proposed possesses several important properties, such weight convergence networks, Bellman (sub)optimality cost-to-go value function and input, practical stability human-robot system under input constraint. further systematic simulation using realistic simulator, OpenSim, emulate how dHDP enables level ground walking, walking different terrains at paces. These results not only theoretically suitable, but also practically useful.
منابع مشابه
Dynamic Control with Actor-Critic Reinforcement Learning
4 Actor-Critic Marble Control 4 4.1 R-code . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 4.2 The critic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 4.3 Unstable actors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 4.4 Trading off stability against...
متن کاملActor-Critic Reinforcement Learning with Simultaneous Human Control and Feedback
This paper contributes a first study into how different human users deliver simultaneous control and feedback signals during human-robot interaction. As part of this work, we formalize and present a general interactive learning framework for online cooperation between humans and reinforcement learning agents. In many humanmachine interaction settings, there is a growing gap between the degrees-...
متن کامل1 Supervised Actor - Critic Reinforcement Learning
Editor’s Summary: Chapter ?? introduced policy gradients as a way to improve on stochastic search of the policy space when learning. This chapter presents supervised actor-critic reinforcement learning as another method for improving the effectiveness of learning. With this approach, a supervisor adds structure to a learning problem and supervised learning makes that structure part of an actor-...
متن کاملActor-Critic Reinforcement Learning with Energy-Based Policies
We consider reinforcement learning in Markov decision processes with high dimensional state and action spaces. We parametrize policies using energy-based models (particularly restricted Boltzmann machines), and train them using policy gradient learning. Our approach builds upon Sallans and Hinton (2004), who parameterized value functions using energy-based models, trained using a non-linear var...
متن کاملCohesion-based Online Actor-Critic Reinforcement Learning for mHealth Intervention
In the wake of the vast population of smart device users worldwide, mobile health (mHealth) technologies are hopeful to generate positive and wide inuence on people’s health. ey are able to provide exible, aordable and portable health guides to device users. Current online decision-making methods for mHealth assume that the users are completely heterogeneous. ey share no information among ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE/CAA Journal of Automatica Sinica
سال: 2022
ISSN: ['2329-9274', '2329-9266']
DOI: https://doi.org/10.1109/jas.2021.1004272